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SYNTHETIC MAMMALIAN a-N-ACETYLGLUCOSAMINTDASE AND 
GENETIC SEQUENCES ENCODING SAME 



FIELD OF THE INVENTION 

The present invention relates generally to mammalian a-N-acetylglucosaminidase and to 
genetic sequences encoding same and to the use of these in the investigation, diagnosis and 
treatment of subjects suspected of or suffering from o-N-acetylglucosaminidase deficiency. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino acid sequences referred to in the specification are defined following 
the bibliography. 

Throughout this specification, unless the context requires otherwise, theword "comprise", 
or variations such as "comprises" or "comprising", will be understood to imply the 
inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 



BACKGROUND TO THE INVENTION 

The increasing sophistication of recombinant DNA technology is greatly facilitating the 
efficacy of many commercially important industries including areas of medical and 
pharmaceutical research and development. The ability to purify native proteins and 
subsequently clone genetic sequences encoding these proteins is an important first step in 
the development of a range of therapeutic and diagnostic procedures. However, 
practitioners have faced many difficulties in purifying target molecules to an exteni 
sufficient to determine amino acid sequences to permit the development of oligonucleotide 
probes to assist in the cloning of genetic sequences encoding the target molecules. 
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Such difficulties have been particularly faced in the research and development of lysosomal 
enzymes. An important lysosomal enzyme is a-N-acetylglucosaminidase (EC 2.1.50). 
This enzyme acts as a exoglycosidase in lysosomes to hydrolyse the terminal a-N- 
5 acetylglucosamine residues present at the non-reducing terminus of fragments of heparan 
sulphate and heparin (Hopwood, 1989). A deficiency in this lysosomal hydrolase is 
responsible for the pathogenesis of Sanfilippo B (Mucopolysaccharidosis type TTTR [MPS- 
IIIB]) syndrome (von-Figura and Kresse, 1972; O'Brien, 1972). This is an autosomal 
recessive disorder of glycosaminoglycan catabolism leading to storage and excretion of 
10 excessive amounts of heparan sulphate and a variety of clinical phenotypes, but classically 
presenting with progressive mental retardation in conjunction with skeletal deformities 
(McKusick and Neufeld, 1983). 

There is a need, therefore, to purify a-N-acetylglucosaminidase and to clone genetic 
15 sequences encoding same to permit development of a range of therapeutic and diagnostic 
procedures to assist in the diagnosis and treatment of disease conditions arising from a-N- 
acetylglucosaminidase deficiency. 

20 SUMMARY OF THE INVENTION 

One aspect of the invention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which encodes or is complementary to a sequence which encodes 
a mammalian a-N-acetylglucx>saminidase or fragment or derivative thereof. 

25 A second aspect of the invention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which is capable of hybridising under at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID NO:l or SEQ ID NO:3 or a 
complementary strand or a homologue, analogue or derivative thereof. 



30 



Another aspect of the invention is directed an isolated nucleic acid molecule which is at 
least 40% identical to the nucleotide sequence set forth in SEQ ID NO:l or SEQ ID 
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NO: 3 or to a complementary strand thereof or a homologue, analogue or derivative 
thereof. 

A further aspect of the present invention provides a nucleic acid molecule comprising a 
5 sequence of nucleotides encoding or complementary to a sequence encoding a polypeptide 
capable of hydrolysing the terminal a-N-acetylglucosamine residues present at the non- 
reducing terminus of fragments of heparan sulphate and heparin residues and wherein said 
nucleotide sequence is capable of hybridising under low stringency conditions to the 
nucleotide sequence set forth in SEQ ID NO: 1 . 

10 

Jjjj A further aspect of the invention is directed to a genetic construct comprising a sense 

5^' molecule, for the expression or over-expression of a-N-acetylglucosaminidase in 

If* prokaryotic or eukaryotic cells. 

: : : 

15 A further aspect of the present invention is directed to synthetic a-N-acetylglucosaminidase 
O — or like molecule. 

fell — . - 

p A further aspect of the invention contemplates antibodies to a-N-acetylglucosaminidase 

^ and preferably synthetic a-N-acetylglucosaminidase or a like molecule. 

20 

In still yet another aspect of the present invention there is contemplated a method of 
diagnosing a mutation or other abberations in the a-N-acetylglucosaminidase gene in a 
human or animal patient. 

25 ~Anotbef~^p^ from a-N- 

acetylglucosaminidase deficiency, such as in MPS-IHB, said method comprising 
administering to said patient an effective amount of a-N-acetylglucosaminidase or active 
like form thereof. 

30 Another aspect of the present invention is directed to a pharmaceutical composition 
comprising a recombinant mammalian a-N-acetylglucosaminidase or an active fragment 
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or derivative thereof and one or more pharmaceutical^ acceptable carriers and/or diluents. 



5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure_l_is_a„photographic_representation of_a-N-acetylglucosaminidase purified from 

human placenta following SDS/PAGE. Lane 1: M, standards (kDa); Lanes 2 and 3 : 
purified a-N-acetylglucosaminidase from human placenta. Lane 4 and 5, bovine serum 
albumin. 

10 

"Figure 2 is a phdtogr^Mc^epresentation of an SDS/polyacrylamide gel showing the 
molecular weights of recombinant a-N-acetylglucosaminidase polypeptides produced in 
CHO cells before (-) and after (+) PNGase F digestion. The 50 mM NaCl and 75 mM 
NaCl fractions are indicated. Molecular weights of a-N-acetylglucosaminidase 

15. polypeptides are indicated on the left of the figure._Molecular_weights of marker proteins 

— are indicated on the right-hand side of the figure (lane 5). 

Single and three letter abbreviations of conventional amino acid residues as used herein are 
defined in Table 1. 

20 

Suitable amino acid substitutions referred to herein are defined in Table 2. 

Codes for non-conventional amino acid residues as used herein are defined in Table 3. 



TABLE 1 



Amino Acid 



Three-letter 
Abbreviation 



One-letter 
Symbol 



10 
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20 



25 



— Alanine — — = — 
Arginine 


Ala 

Arg 


Asparagine 


Asn 


Aspartic acid 


Asp 


Cysteine 


Cys 


Glutamine 


Gin 


Glutamic acid 


Glu 


Glycine 


Gly 


Histidine 


His 


— Isoleucine 


He 


Leucine 


Leu 


Lysine 


Lys 


Methionine 


Met 


Phenylalanine 


Phe 


Proline 


Pro 


Serine 


Ser 


Threonine 


Thr 


Tryptophan 


Trp 


Tyrosine 


Tyr 


Valine 


Val 


Any residue 


Xaa 



— - A— 
R 
N 
D 
C 
Q 

£ 
G 
H 

1 



L 

K 

M 

F 

P 

S 

T 

W 

Y 

V 

X 



TABLE 2 

Suitable residues for amino acid substitutions 



Or i ginal Residue Exemplary S ubstitutions 

Ala Ser 

Arg -Lys 

Asn Gin; His 

Asp Glu 

Cys Ser 

Gin Asn 

Glu Asp 

Gly Pro 

His Asn; Gin 

He Leu; Val 

Leu ^_He;_Val 

Lys~ Arg; Gin; Glu 

Met Leu; De 

Pne Met; Leu; Tyr 

Ser Thr 

Thr Ser 

Tip Tyr 

Tyr Trp; Phe 

Val He; Leu 
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Non-conventional 
amino acid 

5 


Code 


Non-conventional 
amino acid 


Code 


a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


1 0 aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorborayl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohexylalanine 


Chexa 


L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 


15 D-alanine 


Dal 


L-N-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


D-aspartic acid 


Dasp 


L-N-methylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 


D-glutamine 


Dgln 


L-N-methylnorvaline 


Nmnva 


20 D-glutamic acid 


Dglu 


L-N-methylorni thine 


Nmorn 


D-histidine 


Dhis 


L-N-methylphenylalanine 


Nmphe 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nmser 


D-lysine 


Dlys 


L-N-methylthreonine 


Nmthr 


25 D-methionine 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-orni thine 


Dora 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


30 D-threonine 


Dthr 


L-norleucine 


Nle 


D-tryptophan 


Dtrp 


L-norvaline 


Nva 
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D-tyrosine 


Dtyr 


cc-mcthyl-aminoisobutyratc 


Maib 






D-valine 


Dval 


a-methvl-v-aminobutvrate 


ATA ftOt/U 






D-a-methylalaoine 


Dmala 


tz-methvlcvclohexvlalanine 


_ ▼_ vtivAU 






D-a-methylarginine 


Dmarg 


ff-methvlcvlconentvlalaninp 






5 


D-tt-methvlasDarapine 

Jul-* VI 1 T JVVJM_V^^4-Wl VI I jLly 


DmasD 


ft- m p th v 1 - /»- n n n t Vi vl a 1 si n i n*» 

«*"*JJJ.W UJ. jr 1 1* 11_1L/ lu y I** 1 1 H lv 


ivin nap 






D - a-methvlasD2rtate 


Dmasn 

A-' LiilUL/ 


rf-mPthvlnpnioillatTi inp 


ivipcii 






D- a-meth vie vsteine 


Dm cvs 

"■"'111 V* j 0 


1 1 on i j liuu u ijr i j glycine 


iNgiU 






D-a-methvlplutamine 


Dmpln 


^J— ^^-JiminriPthvl^O'lvi^irip 


iNaeg 






D-tt-methvlhistidine 


Dmhis 


W-^-QminnnrnnvlWlvrinp 


in urn 




10 


D-tt-methvlisoleucine 


Dmile 


N*st m i nri— rf-TTiPtHvlhiirvrriit** 

1 ~ Qll 1 1 1 I\J 1* li-lCLlljf 1U U Ijfl die 


INDJiiaDU 


c — : 

io 




D-a-methvlleucine 


Dmleu 


rt- tin n t h vl st 1 si n i tip 


A MMM 

/viiap 


ill 




D-cc-methyllysine 


Dmlys 


N-h p n 7 vl o 1 vc i ne 


INpilC 


In 
h 




D- cc-methylmethionine 


Dmmet 


^J— f2 — r^s* rh si tti vl p tVi vl ^ o 1 vr* i n p 
l^l \£*^*raX Ualliy IwlXljrl ygijr C111C 


in gin 






D-a-methvlorni thine 


Dmom 


^pji rhsi tti vl m p tVi vl ^ o 1 vr* i tip 


iNosn 




15 


D-a-methvlDhenvlalanine 

^ »*» B T 1 ■ 1 1 1 T 11*1 IVl IT 1 *" * f * 1 


Dmnhe 


T 0 — f*5l r Vlf\Y VP tfl vl ^ Q 1 VP iflA 


INglU 






D-a-methvlnroline 

_^ m ti IT II rl 1 "1 II li^r 


Dmnrn 


^ r*JirTir4YvrtiPthvl^tylvr*iiiP 


INclSp 






D- cc-methvlserine 


Dmser 


KT-r»vr 1 nHi i tvl o 1 vr i np 


Mr Hi it 






D-cx-rnethvlthreonine 


Dmthr 


N^-r*vr*lririPnfvl &l vr-irip 


iNuncp 






D-a-methyltryptophan 


Dmtrp 


N-cvclohexvl p1 vcine 






20 


D-a-methyltyrosine 


Dmty 


N-r* vp 1 oH p/^ vl o I vr i n p 


INLAlC^ 






D-a-methylvaline 


Dmval 


N-cvlcododecvlclvcine v 


Ncdod 






D-N-methylalanine 


Dnmala 


N-cvclooctvl I vcine 

A ^ W ¥ WlvvvlJ Itl J III 


Ncoct 






D-N-methylarginine 


Dnmarg 

______ _j 


N-cyclopropylglycine 


Ncpro 






I"}- N— m p th vl si cn a rsi o \ tip 

-_-' l^l lUwUljr IdoUcU aglllw 


FinniQen 


in -cy ciounaecy igiycme 


XT_.. n J 

iNcunci 
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D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl) 












glycine 


Nbhm 






D-N-methylcysteine 


Dnmcys 


N-(3 ,3-diphenylpropyI) 












glycine 


Nbhe 



u-N-metnylglutamine 


Dnmgln 


N-(3-guanidinopropyl) 






- 


glycine 


Narg 


u-ri -metny Iglu tamate 


Dnmglu 


N-(l -hydroxyethyl)glycine 


Nthr 


D-N-metnylnistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 


D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl)) 








glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl) 








glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-y-aminobutyrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylornithine 


Dnmorn 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 


N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-( 1 -methylpropyl)glycine 


Nile 


D-N-methylserine 


Dninser 


N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethy l)glycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicUlamine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-f-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


L- a- me thy lar ginine 


Marg 


L-o-methylasparagine 


Masn 


L-a-methylaspartate 


X £ 

Masp 


L-a-methyl-f-butylglycine 


Mtbug 


L-a-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 


L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomo 








phenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl) 








glycine 


Nmet 


L-a-methylleucine 


Mleu 


L-a-methyllysine 


Mlys 
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L-a-methylmethionine Mmet L-a-methylnorleucine Mnle 

L-o-methylnorvaline Mnva L-a-methylornithine Morn 

_I^-methylphenylaIanine — Mphe L^methylproline .__ . Mpro 

I-ct-methylserine Mser L-o-methylthreonine Mthr 

5 L-o-methyltryptophan Mtrp L-o-methyltyrosine Mtyr 

L-a-methylvaline Mval __^^methylhomo 

phenylalanine Nmhphe 
N-(N-(2,2-diphenylethyl) N-(N-(3,3-diphenylpropyl) 

carbamylmethyl)glycine Nnbhm carbamylmethyl)glycine Nnbhe 
10 1 -carboxy- 1 -(2 ,2-diphenyl- 

ethylamino)cyclopropane Nmbc 



5 



DETAILED DESCRD7TION OF THE PREFERRED EMBODIMENTS 

JThe prwent invention provides an isolated nucleic acid molecule comprising a sequence 
of nucleotides which encodes, or are complementary to a sequence which encodes, a 
mammalian o-N-acetylglucosaminidase or fragment or derivative thereof or its like 
molecule. 



Preferably, the mammal is a human, livestock animal, companion animal, wild animal or 
laboratory test animal (e.g. rabbit, rat, mouse or guinea pig). Most preferably, the mammal 
10 is a human. Conveniently, the o-N-acetylglucosaminidase is isolatable from the liver, 
kidney or placenta. However, the present invention extends to all mammalian o-N- 
acetylglucosaminidase enzymes and from any anatomical or cellular source and/or any 
biological fluid source, such as but not limited to plasma, serum, cell extract or lymph 



fluid. 
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Although a preferred embodiment of the present invention contemplates the use of human 
o-N-acetylglucosaminidase or genomic or recombinant (e.g. cDNA) genetic sequences 
encoding same in the investigation, diagnosis and/or treatment of human subjects (i.e. 
homologous system), one skilled in the art will appreciate that the enzyme or genetic 
20 sequences encoding same from a non-human animal may also be useful. Such a 
heterologous system is encompassed by the present invention. 

The term "nucleic acid molecule" as used herein shall be taken to refer to any RNA or 
DNA (eg. cDNA) molecule, whether single-stranded or double-stranded or in a linear or 
covalently-closed form. The nucleic acid molecule may also be DNA corresponding to the 
entire genomic gene or a substantial portion thereof or a fragment or derivative thereof. 

The nucleic acid molecule of the present invention may constitute solely the nucleotide 
sequence encoding o-N-acetylglucosaminidase or a o-N-acetylglucosaminidase-like 
molecule or may be part of a larger nucleic acid molecule. Accordingly, the present 
invention extends to the isolated genomic o-N-acetylglucosaminidase gene. The non- 
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10 



translated sequences in a larger nucleic acid molecule may include vector, transcriptional 
and/or translation^ regulatory sequences, promoter, terminator, enhancer, replication or 
jignal sequences or non-coding regions (eg intron sequences) of an isolated genomic gene. 

Reference herein to a "gene" is to be taken in its broadest context and includes: 

(i) a classical genomic gene consisting of transcriptional and/or translation^ 
regulatory sequences and/or a coding region and/or non-translated sequences (i.e. 
introns, 5'- and 3'- untranslated sequences); 

(ii) mRNA or cDNA corresponding to the coding regions (i.e. exons) 
optionally comprising 5'- or 3 '-untranslated sequences of the gene; or 

- (iii) synthetic, amplified DNA fragments or other recombinant nucleic acid 
molecules produced in vitro and comprising all or a part of the coding region and/or 
5'- or 3 '-untranslated sequences of the gene. 

15 The term "gene" is also used to describe synthetic or fusion molecules encoding all or 
part of a functional product. A functional product is one which comprises a sequence of 
nucleotides or is complementary to a sequence of nucleotides which encodes a functional 
polypeptide, in particular a polypeptide having the catalytic activity of o-N- 
acetylglucosaminidase or a homologue, analogue or derivative thereof. 

20 

For the present purpose, "homologies" of a nucleotide sequence shall be taken to refer 
to an isolated nucleic acid molecule which is substantially the same as the nucleic acid 
molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence within said sequence, of one or more nucleotide 
25 substitutions, insertions, deletions, or rearrangements. 

•Analogues" of a nucleotide sequence set forth herein shall be taken to refer to an 
isolated nucleic acid molecule which is substantially the same as a nucleic acid molecule 
of the present invention or its complementary nucleotide sequence, notwithstanding the 
30 occurrence of any non-nucleotide constituents not normally present in said isolated 
nucleic acid molecule, for example carbohydrates, radiochemicals including 
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radionucleotides, reporter molecules such as, but not limited to DIG, alkaline 
phosphatase or horseradish peroxidase, amongst others. 

"Derivatives" of a nucleotide sequence set forth herein shall be taken to refer to any 
5 isolated nucleic acid molecule which contains significant sequence similarity to said 
sequence or a part thereof. Generally, the nucleotide sequence of the present invention 
may be subjected to mutagenesis to produce single or multiple nucleotide substitutions, 
deletions and/or insertions. Nucleotide insertional derivatives of the nucleotide sequence 
of the present invention include 5' and 3' terminal fusions as well as intra-sequence 

10 insertions of single or multiple nucleotides or nucleotide analogues. Insertional 
nucleotide sequence variants are those in which one or more nucleotides or nucleotide 
analogues are introduced into a predetermined site in the nucleotide sequence of said 
sequence, although random insertion is also possible with suitable screening of the 
resulting product being performed. Deletional variants are characterised by the removal 

15 of one or more nucleotides from the nucleotide sequence. Substitutional nucleotide 
^^J 5 are those in which at least onejnicleotide m the sequence has been removed and 
a different nucleotide or nucleotide analogue inserted in its place. 

Preferably, a homologue, analogue or derivative of an a-N-acetylglucosaminidase gene 
20 according to any embodiments described herein, comprises a sequence of nucleotides of 
at least 10 contiguous nucleotides derived from SEQ ID NO:l or SEQ ID NO:3 or a 
complementary strand thereof, wherein the sequence of said homologue, analogue or 
derivative is at least 40% identical to SEQ ID NO:l or SEQ ID NO:3 or a 
complementary strand thereof or wherein said homologue, analogue or derivative is 
25 capable of hybridising to said sequence under at least low stringency hybridisation 
conditions. 

For the purposes of nomenclature, the nucleotide sequence set for in SEQ ID NO: 1 
relates to the cDNA encoding the human a-N-acetylglucosaminidase enzyme. 

30 

The nucleotide sequence set forth in SEQ ID NO:3 relates to the genomic gene equivalent 
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of the cDNA encoding the human liver a-N-acetylglucosaminidase enzyme. Those 
skilled in the art will be aware that the specific exon sequences described in SEQ ID 
NO:3 correspond to the coding regions of the a-N-acetylglucosaminidase gene, said exon 
regions further comprising the entire open reading frame of the cDNA sequence set forth 
in SEQ ID NO:l, when aligned in a head-to-tail configuration. The intron sequences of 
SEQ ID NO:3, which correspond to non-coding regions of the gene which are spliced 
from the primary transcription product thereof, although not explicitly defined, may be 
readily deduced by those skilled in the art, when provided with the exon sequence data 
provided in the nucleotide sequence listing. 
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The nucleotide sequence of the present invention may correspond to the sequence of the 
naturally-occurring a-N-acetylglu<x>saminidase gene or may comprise a homologue, 
analogue or derivative thereof which contains single or multiple nucleotide substitutions, 
deletions and/or additions. All such homologues, analogue or derivatives encode a-N- 
15 acetylglucosaminidase or o-N-acetylgjucosaminidase-like molecules or a homologue, 
analogue or derivative thereof as contemplated by the present invention. The length of the 
nucleotide sequence may^ary from a few ba^ acid probes or primers, 

to a full length sequence. 

20 The present invention is particularly directed to the nucleic acid in cDNA form and 
particularly when inserted into an expression vector. The expression vector may be 
replicable in a eukaryotic or prokaryotic cell and may either produce mRNA or the mRNA 
may be subsequently translated into a-N-acetylglucosaminidase or like molecule. 
Particularly preferred eukaryotic cells include CHO cells but may be in any other suitable 

25 mammalian cells or cell lines or non-mammalian cells such as yeast or insect cells. 

In an alternative embodiment, the present invention provides a nucleic acid molecule 
comprising a sequence of nucleotides which encodes or are complementary to a sequence 
which encodes a polypeptide capable of hydrolysing the a-N-acetylglucosamine residues 
30 from the non-reducing terminus of heparan sulphate and heparin fragments and wherein 
said nucleotide sequence is capable of hybridising under at least low stringency conditions 
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to a nucleotide sequence set forth in SEQ ID NO:l or SEQ ID NO:3 or a homologue, 
analogue or derivative thereof. 

A second aspect of the invention provides an isolated nucleic acid molecule comprising a 
sequence of nucleotides which is capable of hybridising under at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID NO. l or SEQ ID NO:3 or a 
complementary strand or a homologue, analogue or derivative thereof. 

Preferably, hybridisation is possible under at least medium stringent conditions. More 
10 preferably, hybridisation is possible under high stringent conditions. 

For the purposes of defining the level of stringency, reference can conveniently be made 
to Sambrook et al (1989) or Ausubel etal.{\9%l) which are herein incorporated by 
reference. 

15 

A low stringency is defined herein as being a hybridisat ion and/or- wash carried out in 4-6X 
.SSC/0 : l-0.5% w /v SDS at 37-45»C Jbr 2-3 hours. A medium stringency hybridisation 
and/or wash is carried out in 1-4X SSC/0.25-0.5% w/v SDS at * 45 °C for 2-3 hours and 
a high stringency hybridisation and/or wash is carried out 0. MX SSC/0. 1% w/v SDS at 
20 60°C for 1-3 hours. 

Alternative conditions of stringency may be employed to those specifically recited herein. 
Generally, the stringency is increased by reducing the concentration of SSC buffer, 
and/or increasing the concentration of SDS and/or increasing the temperature of the 

25 ^hybridisation and/or wash. Those skilled in the art will be aware that the conditions for 
hybridisation and/or wash may vary depending upon the nature of the hybridisation 
membrane or the type of hybridisation probe used. Conditions for hybridisations and 
washes are well understood by one normally skilled in the art. For the purposes of 
clarification of parameters affecting hybridisation between nucleic acid molecules, 

30 reference is found in pages 2.10.8 to 2.10.16. of Ausubel et al. (1987), which is herein 
incorporated by reference. 
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Those skilled in the art will be aware that the nucleotide sequences set forth in SEQ ID 
NO:l and SEQ ID NO:3 may be used to isolate the corresponding genes from other 
_human tissues or altern^ly.J^ thejjs^es or cells of_other_species, without undue 
experimentation. Means for the isolated of such related sequences will also be known 
to those skilled in the art, for example nucleic acid hybridisation, polymerase chain 
reaction, antibody screej^ofexpression libi^es.Junctiohal screening of expression 
libraries, or complementation of mutants, amongst others. The present invention is not 
to be limited by the source from which the specific gene sequences described herein have 
been isolated or by the means used to isolate said sequences. 



— In one embodiment, a-related-genetic-sequence comprising genomic DNA, or mRNA, 
or cDNA is contacted with a hybridisation effective amount of a genetic sequence which 
encodes a-N-acetylglucosaminidase, or its complementary nucleotide sequence or a 
homologue, analogue, derivative or functional part thereof, and then said hybridisation 
IS is detecte d using a suitable detectio n means. 



The related genetic sequence may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the related 
genetic sequence originates from an animal species or a human. More preferably, the 
20 related genetic sequence originates from a human. 

Preferably, the genetic sequence which encodes a-N-acetylglucosaminidase (i.e probe 
or latter genetic sequence) comprises a sequence of nucleotides of at least 10 nucleotides, 
more preferably at least 20 nucleotides, even more preferably at least 50 nucleotides and 
25 even still more preferably-at least 100 nucleotides derived from the nucleotide sequence 
set forth in SEQ ID NO: 1 or SEQ ID NO:3 or a complementary sequence or a 
toroologue, analogue or derivative thereof. 



30 



Preferably, the detection means is a reporter molecule capable of giving an identifiable 
signal (e.g. a radioisotope such as »P or "S or a biotinylated molecule) covalently 
attached to the a-N-ac«tylglucosaniinidase probe. 
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In an alternative embodiment, the detection means is a polymerase chain reaction. 
According to this embodiment, two opposing non-complementary nucleic acid "primer 
molecules" of at least 10 nucleotides in length, more preferably at least 20 nucleotides 
in length, derived from the nucleotide sequence set forth in SEQ ID NO:l or SEQ ID 
5 NO:3 may be contacted with a nucleic acid "template molecule" and specific nucleic acid 
molecule copies of the template molecule amplified in a polymerase chain reaction. 

The opposing primer molecules are selected such that they are each capable of 
hybridising to complementary strands of the same template molecule, wherein DNA 
10 polymerase-dependant DNA synthesis occurring from a first opposing primer molecule 
will be in a direction toward the second opposing primer molecule. 

Accordingly, both primers hybridise to said template molecule such that, in the presence 
of a DNA polymerase enzyme, a cofactor and appropriate substrate, DNA synthesis 
occurs in the 5' to 3' direction from each primer molecule towards the position on the 
DNA where the other primer molecule is hybridised, thereby amplifying the intervening 
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DNA 



Those skilled in the art are aware of the technical requirements of the polymerase chain 
20 reaction and are capable of any modifications which may be made to the reaction 
conditions. For example, of the polymerase chain reaction may be used in any suitable 
format, such as amplified fragment length polymorphism (AFLP), single-strand chain 
polymorphism (SSCP), amplification and mismatch detection (AMD), interspersed 
repetitive sequence polymerase chain reaction (IRS-PCR), inverse polymerase chain 
25 reaction (iPCR) and reverse transcription polymerase chain reaction (RT-PCR), amongst 
others, to isolate a related a-N-acetylglucosaminidase gene sequence or identify a 
mutation in an a-N-acetylglucosaminidase genetic sequence. Such variations of the 
polymerase chain reaction are discussed in detail by McPherson et al (1991), which is 
incorporated herein by reference. The present invention encompasses all such variations, 
30 the only requirement being that the final product of the reaction is an isolated nucleic acid 
molecule which is capable of encoding a-N-acetylglucosaminidase or a homologue, 



analogue or derivative thereof. 
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In a preferred embodiment, the first primer molecule is preferably derived from the sense 
strand of a gene which encodes o-N-acetylglucosaminidase, in particular from the 
5 nucleotide sequence set forth in SEQ ID NO.l or SEQ ID NO:3 or a homologue, 
derivative or analogue thereof and the second primer molecule is preferably derived from 
the antisense strand of said gene. 

Those skilled in the art will be aware that it is not essential to the performance of the 
10 invention that the primer molecules be derived from the same gene. 
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According to this embodiment, the nucleic acid primer molecule may further consist of 
a combination of any of the nucleotides adenine, cytidine, guanine, thymidine, or 
inosine, or functional analogues or derivatives thereof, capable of being incorporated into 
a polynucleotide molecule provided that it is capable of hybridising under at least low 
stringency conditions to the nucleic acid molecule set forth in SEQ ID NO:l or SEQ ID 
NO:3.or_-a,hpmoIogue, analogue,or derivative-thereof.- - — 

The nucleic acid primer molecules may further be each contained in an aqueous pool 
20 comprising other nucleic acid primer molecules. More preferably, the nucleic acid 
primer molecule is in a substantially pure form. 

The nucleic acid template molecule may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the related 
genetic.sequence originates from a ceU,_tissue, or organ derived from an animal species 
or a human. More preferably, the related genetic sequence originates from a cell, tissue, 
or organ derived from a human. 

Accordingly, a third aspect of the present invention extends to an isolated nucleic acid 
30 molecule which is at least 40 % identical to the nucleotide sequence set forth in SEQ ID 
NO:l or SEQ ID NO:3 or to a complementary strand thereof or a homologue, analogue 



or derivative thereof. 
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Preferably, the percentage identity to SEQ ID NO:l or SEQ ID NO:3 is at least about 
55%, still more preferably at least about 65%, yet still more preferably at least about 75- 
5 80% and even still more preferably at least about 85-95%. 

In an even more preferred embodiment, the present invention provides an isolated 
nucleic acid molecule which is at least 40% identical to the nucleotide sequence set forth 
in SEQ ID NO:l or SEQ ID NO:3 or to a complementary strand thereof or a homologue, 
10 analogue or derivative thereof and is capable of hybridising under at least low stringency 
conditions to a nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID NO:3. 

In a particularly preferred embodiment, the isolated nucleic acid molecule described 
herein is further capable of encoding a sequence of amino acids which is capable of 
15 carrying out the enzyme reaction catalysed by a a-N-acetylglucosaminidase enzyme. 



The isolated nucleic acid molecule of the present invention is also useful for developing 
a genetic construct comprising a sense molecule, for the expression or over-expression 
of a-N-acetylglucosaminidase in prokaryotic or eukaryotic cells. Particularly preferred 
20 eukaryotic cells include CHO cells but may be in any other suitable mammalian cells or 
cell lines or non-mammalian cells such as yeast or insect cells. 

The term "sense molecule" as used herein shall be taken to refer to an isolated nucleic 
acid molecule of the invention as described herein, which is provided in a format suitable 
25 forJts expression to produce a recombinant polypeptide, -when said sense molecule is 
introduced into a host cell. 

In a particularly preferred embodiment, a sense molecule which encodes the a-N- 
acetylglucosaminidase comprises a sequence of nucleotides set forth in SEQ ID NO: 1 or 
30 SEQ ID NO:3 or a complementary strand, homologue, analogue or derivative thereof. 
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In a most particularly preferred embodiment, the sense molecule of the invention 
comprises the sequence of nucleotides set forth in SEQ ID NO:l or a complementary 
strand, homologue, analogue or derivative thereof. 

5 Those skilled in the art will be aware that expression of a sense molecule may require the 
nucleic acid molecule of the invention to be placed in operable connection with a 
promoter sequence to produce a "sense construct". The choice of promoter for the 
present purpose may vary depending upon the level of expression of the sense molecule 
required and/or the tissue-specificity or developmental-specificity of expression of the 
10 sense molecule which is required. The sense construct may further comprise a terminator 
sequence and be introduced into a suitable host cell where it is capable of being expressed 
to produce a recombinant polypeptide gene product. 

In the context of the present invention, a sense molecule which corresponds to a genetic 
15 sequence or isolated nucleic acid molecule which encodes a-N-acetylglucosaminidase 
^^polypeptide or a homologue, analogue or- derivative thereof, placed operably under the 
control of a suitable promoter sequence, is introduced into a cell using any suitable 
method for the transformation of said cell and said genetic sequence or isolated nucleic 
acid molecule is expressed therein to produce said polypeptide. 
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The present invention clearly extends to genetic constructs designed to facilitate 
expression of any nucleic acid molecule described herein. 

A genetic construct of the present invention comprises the foregoing sense molecule, 
25 -Placed^perably^ of regulating the 

expression of the said nucleic acid molecule in a prokaryotic or eukaryotic cell, 
preferably a mammalian cell such as a CHO cell, a yeast cell, insect cell or bacterial cell' 
The said genetic construct optionally comprises, in addition to a promoter and sense 
molecule, a terminator sequence. 

30 

The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
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which signals termination of transcription. Terminators are 3 '-non-translated DNA 
sequences containing a polyadenylation signal, which facilitates the addition of 
polyadenylate sequences to the 3'-end of a primary transcript. Terminators active in 
plant cells are known and described in the literature. They may be isolated from 
5 bacteria, fungi, viruses, animals and/or plants. 

Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA box 
which is required for accurate transcription initiation, with or without a CCAAT box 

10 sequence and additional regulatory elements (i.e. upstream activating sequences, 
enhancers and silencers) which alter gene expression in response to developmental and/or 
external stimuli, or in a tissue-specific manner. A promoter is usually, but not 
necessarily, positioned upstream or 5', of a structural gene, the expression of which it 
regulates. Furthermore, the regulatory elements comprising a promoter are usually 

1 5 positioned within 2 kb of the start site of transcription of the gene. 

In the present context, the term "promoter " is also used to describe a synthetic or fusion 
molecule, or derivative which confers, activates or enhances expression of said sense 
molecule in a cell. 
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Preferred promoters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression of the sense molecule and/or to alter the spatial 
expression and/or temporal expression of said sense molecule. For example, regulatory 
elements which confer copper inducibility may be placed adjacent to a heterologous 
25 promoter-sequence-drrving expression-of a sense molecule, thereby conferring copper 
inducibility on the expression of said molecule. 

Placing a sense molecule under the regulatory control of a promoter sequence means 
positioning the said molecule such that expression is controlled by the promoter 
30 sequence. Promoters are generally positioned 5' (upstream) to the genes that they 
control, in the construction of heterologous promoter/structural gene combinations it is 
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generally preferred to position the promoter at a distance from the gene transcription start 
site that is approximately the same as the distance between that promoter and the gene 
it controls in its natural setting, i.e., the gene from which the promoter is derived. As 
is known in the art, some variation in this distance can be accommodated without loss of 
5 promoter function. Similarly, the preferred positioning of a regulatory sequence element 
with respect to a heterologous gene to be placed under its control is defined by the 
positioning of the element in its natural setting, i.e., the genes from which it is derived. 
Again, as is known in the art, some variation in this distance can also occur. 

10 Examples of promoters suitable for use in genetic constructs of the present invention 
include viral, fungal, bacterial, animal and plant derived promoters capable of 
functioning in animal, human, yeast, insect or bacterial cells. The promoter may 
regulate the expression of the said molecule constitutively, or differentially with respect 
to the tissue in which expression occurs or, with respect to the developmental stage at 

1 5 which expression occurs, or in response to external stimuli such as physiological stresses, 
— or-plant-pathogensi or metal-ions, -amongst others. Preferably, ithe promoter is capable 
of regulating expression of a sense molecule in a cell derived from an animal species or 
human. 

20 In a particularly preferred embodiment, the promoter is derived from the genomic gene 
encoding a-N-acetylglucosaminidase, preferably the human a-N-acetylglucosaminidase 
gene. In a more preferred embodiment, however, the promoter is derived from the 
nucleotide sequence set forth in SEQ ID NO:3 or is at least capable of hybridising to 
nucleotide residues 1 to 989 of SEQ ID NO:3 or at least 20 contiguous nucleotides 

25 derivedtherefrom. - 

In an even more particularly preferred embodiment, the promoter is the CMV promoter 
sequence or a promoter sequence derived therefrom. 
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An alternative embodiment of the invention is directed to a genetic construct comprising 
a promoter or functional derivative, part fragment, homologue, or analogue thereof, 
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derived from me a-N-acetylglucosaminidase genomic gene defined by SEQ ID NO: 3. 

Preferably, said genetic construct further comprises the a-N-acetylglucosaminidase 
sequence defined by SEQ ID NO:l placed in operably connection with said promoter. 

A further aspect of the present invention is directed to synthetic a-N-acetylglucosaminidase 
or like molecule. 

The term "synthetic" as used herein shall be taken to include both recombinant and 
10 chemically-synthesised molecules produced by the sequential addition of amino acid 
residues or groups of amino acid residues in defined order. - 

In one embodiment, the invention relates to recombinant a-N-acetylglucosaminidase or 
like molecule encoded by or expressed from the nucleic acid molecules as hereinbefore 
15 described. 
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In another embodiment me-syn^ ^ 
comprises a sequence of amino acids which is at least 40% identical to the amino acid 
sequence set forth in any one of SEQ ID Nos:2, 4, 5 or 6. 

More preferably, the percentage identity is at least 60% and still more preferably at least 
80% or 85-90%. 



A particularly preferred embodiment of the present invention provides a synthetic «-N- 
25 -acetylglucosaminidase as hereinbefore defined which comprises a sequence of amino acids 
substantially as set forth in any one of SEQ ID Nos:2, 4, 5 or 6 or a homologue, analogue 
or derivative thereof. 

For the purposes of nomenclature, the amino acid sequence set forth in SEQ ID NO 2 
30 comprises the full-length translation product of the human «-N-acetylgJucosaminidase gene 
(i.e. hereinafter referred to as the "a-N-acetylglucosaminidase polypeptide" or "SEQ ID 



4 



in 
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N0:2") produced by expression of either the cDNA sequence defined by SEQ ID NO. l 
or the genomic gene defined by SEQ ID NO:3. The a-N-acetylglucosaminidase 
polypeptide comprises at least seven potentially-glycosylated Asn residues, at positions 
261, 272, 435, 503, 513, 526 and 532. Furthermore, the amino acid sequence of the o-N- 
5 acetylglucosaminidase polypeptide may comprise a signal peptide of approximately 23 
amino acid residues in length, with a probable site for signal peptide peptidase cleavage 
occurring between Gly a and Asp^. 

The amino acid sequences set forth in SEQ ID Nos:4-6 relate to N-terminal and internal 
10 (i.e. CNBr) amino acid sequences derived from human a-N-acetylglucosaminidase, 
purified as described in Example 1. As described in Example 2, the purified form of the 
enzyme comprises two polypeptides having approximate molecular weights of 82 and 77 
kDa. The sequence set forth in SEQ ID NO:4 relates to the N-terminal sequence of the 82 
kDa polypeptide, while SEQ ID NO:5 relates to the N-terminal sequence of the 77 kDa 
polypeptide. Furthermore, SEQ ID NO:4 comprises amino acids residues 24-43 of SEQ 
ID NO:2, while SEQ ID NO:5 comprises amino acid residues 59-76 of SEQ ID NO:2. 
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The amino acid sequence defined by SEQ ID NO:6 relates to the CNBr-cleaved peptide 
of purified human a-N-acetylglucosaminidase. This amino acid sequence aligns with 
20 amino acid residues 540-554 of the a-N-acetylglucosaminidase polypeptide (SEQ ID 
NO:2). 



In the present context, "homologues" of a polypeptide refer to those polypeptides, 
enzymes or proteins which have a similar a-N-acetylglucosaminidase enzyme activity, 
notwithstanding any amino acid substitutions, additions or deletions thereto. A 
homologue may be isolated or derived from the same or another animal species. 



25 



Furthermore, the amino acids of a homologous polypeptide may be replaced by other 
amino acids having similar properties, for example hydrophobicity, hydrophilicity, 
30 hydrophobic moment, charge or antigenicity, and so on. 
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" Analogues" encompass a-N-acetylglucosaminidase polypeptides and peptide derivatives 
thereof notwithstanding the occurrence of any non-naturally occurring amino acid 
^analogues .therein.: __. 

The term "derivative" in relation to the polypeptides of the invention refer to mutants, parts 
or fragments of a ftmc^onal .moleaile._Derivatives„include^modified~peptides in which 
ligands are attached to one or more of the amino acid residues contained therein, such 
as carbohydrates, enzymes, proteins, polypeptides or reporter molecules such as 
radionuclides or fluorescent compounds. Glycosylated, fluorescent, acylated or alkylated 
forms of the subject peptides are particularly contemplated by the present invention. 
Additionally, derivatives of a polypeptide may comprise fragments or parts of an amino 
acid sequence disclosed herein and are within the scope of the invention, as are 
homopolymers or heteropolymers comprising two or more copies of the subject 
polypeptides. Procedures for derivatizing peptides are well-known in the art. 

Accordingly, this aspect of the present invention is directed to any proteinaceous molecule 
comprising an amino^acid sequence coirespondi^ mammalian <x-N- 

acetylglucosaminidase enzyme or to a like molecule. The like molecule, therefore, 
comprises parts, derivatives and/or portions of the a-N-acetylglucosaminidase enzyme 
whether functional or not 

Preferably, the mammal is human but may be of non-human origin as contemplated above. 

The synthetic or recombinant a-N-acetylglucosaminidase of the present invention may 
comprise an amino acid sequence corresponding to the naturally occurring amino acid 
sequence or may contain single or multiple amino acid substitutions, deletions and/or 
additions. The length of the amino acid sequence may range from a few residues to a full 
length molecule. 

Amino acid substitutions are typically of single residues. Amino acid insertions will 
usually be in the order of about 1-10 amino acid residues and deletions will range from 
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about 1-20 residues. Preferably, deletions or insertions are made in adjacent pairs, i.e. a 
deletion of two residues or insertion of two residues. 

Amino acid insertional derivatives of o-N-acetylglucosaminidase of the present invention 
5 include amino and/or carboxyl terminal fusions as well as intra-sequence insertions of 
single or multiple amino acids. Insertional amino acid sequence variants are those in which 
one or more amino acid residues are introduced into a predetermined site in the protein 
although random insertion is also possible with suitable screening of the resulting product. 
Deletional variants are characterised by the removal of one or more amino acids from the 
10 sequence. Substitutional amino acid variants are those in which at least one residue in the 
sequence has been removed and a different residue inserted in its place. Typical 
substitutions are those made in accordance with the following Table 2: 

The amino acid variants referred to above may readily be made using peptide synthetic 
15 techniques well known in the art, such as solid phase peptide synthesis (Merrifield 
synthesis) and the like, or by recombinant DNA manipulations. Techniques for making 
substitution mutations at predetermined sites in DNA having known or partially known 
sequence are well known and include, for example, M13 mutagenesis. The manipulation 
of DNA sequence to produce variant proteins which manifest as substitutional, insertional 
20 or deletional variants are conveniently elsewhere described such as Sambrook et al, 1989 
Molecular Cloning: A Laboratory Manual Cold Spring Harbor Laboratories, Cold Spring 
Harbor, NY. 

The derivatives or like molecules include single or multiple substitutions, deletions and/or 
25 additions of any components) naturally or artificially associated with the o-N- 
acetylglucosaminidase enzyme such as carbohydrate, lipid and/or other proteinaceous 
moieties. For example, the present invention extends to glycosylated and non-glycosylated 
forms of the molecule. All such molecules are encompassed by the expression "mutants", 
"derivatives", "fragments", "portions" and "like" molecules. These molecules may be 
30 active or non-active and may contain specific regions, such as a catalytic region. 
Particularly, preferred derivative molecules include those with altered glycosylation 
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patterns relative to the naturally occurring molecule. Even more particularly, the 
recombinant molecule is more highly glycosylated than the naturally occurring molecule. 
Such highly glycosylated derivatives may have improved take-up properties and enhanced 
half-lives. 

„As„indicated^in_„the„„Examples,™the_-molecular— weight of ^purified human a-N- 
acetylglucosaminidase (i.e. 82kDa and 77kDa) and recombinant mammalian o-N- 
acetylglucosaminidase produced in CHO cells (i.e. 89 kDa and 79 kDa) are greater than 
the deduced molecular weight of the a-N-acetylglucosaminidase polypeptide set forth in 
SEQ ID No:2 (i.e. 70 kDa), suggesting that the purified and recombinant polypeptide are 
post-translationally modified. The data presented in Example 8 indicate further that the 
recombinant a-N-acetylglucosaminidase enzyme produced in CHO cells, at least, is 
glycosylated and that the difference in molecular weight determined for the recombinant 
polypeptides and the polypeptide of SEQ ID No: 2 is due almost entirely to glycosylation 
of the recombinant polypeptide by CHO cells. As shown in Example 9, the glycosylated 
recombinant a-N-acetylglucosaminidase polypeptide exhibits enzymatic activity. 

The present invention also extends to synthetic a-N-acetylglucosaminidase or like 
molecules when fused to other proteinaceous molecules. The latter may include another 
enzyme, reporter molecule, purification site or an amino acid sequence which facilitates 
transport of the molecule out of a cell, such as a signal sequence. 

The present invention extends further to post-translational modifications to the a-N- 
acetylglucosaminidase enzyme. The modifications may be made to the naturally occurring 
enzyme or following synthesis by recombiMnrtechniques. The modifications may be at 
the structural level or at, for example, the electrochemical level such as modifying net 
charge or structural conformation of the enzyme. 

Such modification may be important to facilitate entry or penetration of the enzyme into 
selected tissues such as cartilage or blood brain barriers or to increase circulation half-life. 
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Analogues of o-N-acetylglucosaminidase contemplated herein include, but are not limited 
to, modifications to side chains, incorporation of unnatural amino acids and/or their 
derivatives during peptide synthesis and the use of crosslinkers and other methods which 
impose conformational constraints on the enzyme. 

5 

Examples of side_chain_modifications_contemplated_ by_the- present invention include 
modifications of amino groups such as by reductive alkylation by reaction with an 
aldehyde followed by reduction with NaBH«; amidination with methylacetimidate; 
acylation with acetic anhydride; carbamoylation of amino groups with cyanate; 
10 trinitrobenzylation of amino groups with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); 
acylation of amino groups with succinic anhydride and tetrahydrophthalic anhydride; and 
pyridoxylation of lysine with pyridoxal-5 '-phosphate followed by reduction with NaBH*. 

The guanidino group of arginine residues may be modified by the formation of 
15 heterocyclic condensation products with reagents such as 2,3-butanedione, phenylglyoxal 
— — andglyoxal- 



The carboxyl group may be modified by carbodiimide activation via O-acylisourea 
formation followed by subsequent derivatisation, for example, to a corresponding amide. 

20 

Sulphydryl groups may be modified by methods such as carboxymethylation with 
iodoacetic acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of 
a mixed disulphides with other thiol compounds; reaction with maleimide, maleic 
anhydride or other substituted maleimide; formation of mercurial derivatives using 4- 
25 chloromercuribenzoate, 4-chloromercuriphenylsulphonic acid, phenylmercury chloride, 2- 
chloromercuric-4-nitrophenol and other mercurials; carbamoylation with cyanate at 
alkaline pH. 

Tryptophan residues may be modified by, for example, oxidation with N- 
30 bromosuccinimide or alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide 
or sulphenyl halides. Tyrosine residues on the other hand, may be altered by nitration with 
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tetranitromethane to fonn a 3-nitrotyrosine derivative. 



-Modificauon-of-me-imidazole-r^g- by 
alkylation with iodoacetic acid derivatives or N-carbethoxylation with 
5 diethylpyrocarbonate. 



Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3- 
hydroxy-5-phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, 
10 phenylglycine, ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl 
alanine and/or D-isomers of amino acids. Non-naturaUy occurring amino acids 
contemplated by the present invention are incorporated herein, as Table 3. 

Crosslinkers can be used, for example, to stabilise 3D conformations, using homo- 
15 _bjfifflctio^ciosslinkers.such as.me bifunctiond inudo estere-havmg-(GH^ tf spacer groups 
_ - -with-n=l-to-n=6, glutaraldehyde,-N-hydroxysuccinimide esters and hetero-bifunctional 
reagents which usually contain an amino-reactive moiety such as N-hydroxysuccinimide 
and another group specific-reactive moiety such as maleimido or dithio moiety (SH) or 
carbodiimide (COOH). In addition, the enzyme could be conformationally constrained by, 
20 for example, incorporation of C„ and N.-methylamino acids, introduction of double bonds 
between C. and C, atoms of amino acids and the formation of cyclic peptides or analogues 
by introducing covalent bonds such as forming an amide bond between the N and C 
termini, between two side chains or between a side chain and the N or C terminus. 

25 Electrochemical modifications of a-N-acetylglucosaminidase include interaction with 
polylysine or polyethylene glycol or other agent which effects an overall change to the net 
charge of the enzyme. 

Advantageously, the recombinant a-N-acetylglucosaminidase is a biologically pure 
30 preparation meaning that it has undergone some purification away for other proteins and/or 
non-proteinaceous material. The purity of the preparation may be represented as at least 
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40% of the enzyme, preferably at least 60%, more preferably at least 75%, even more 
preferably at least 85% and still more preferably at least 95% relative to non-a-N- 
acetylglucosam inidase material as determined by_weight,_activity, amino- acid homology 



or similarity, antibody reactivity or other convenient means. 
5 

Particul arly preferred methods for the pre paration and purification of recombinant a -N- 

acetylglucosaminidase are provided in Examples 7 and 8. 

Those skilled in the art will be aware of the means of purifying a synthetic or recombinant 
1 0 o-N-acetylglucosaminidase from several sources without undue experimentation and for 
expressing the degree of purity of such a purified preparation of the enzyme. 

The present invention further contemplates antibodies to a-N-acetylglucosaminidase and 
preferably synthetic a-N-acetylglucosaminidase or like molecule. The antibodies may be 
15 p olyclonal or monoclonal, naturally occurrin g or synthetic (including recombinant, 
_ fragment or fusion forms). Such antibodies will be useful in developing immunoassays for 
a-N-acetylglucosaminidase and for identifying additional genetic sequences which are 
capable of expressing a-N-acetylglucosaminidase polypeptides or homologues, analogues 
or derivatives thereof. 
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Both polyclonal and monoclonal antibodies are obtainable by immunisation with an 
appropriate synthetic or recombinant gene product, or epitope, or peptide fragment of a 
gene product, in particular a a-N-acetylglucosaminidase polypeptide or a homplogue, 
analogue or derivative thereof. 

Alternatively, fragments of antibodies may be used, such as Fab fragments. The present 
invention extends further to encompass recombinant and synthetic antibodies and to 
antibody hybrids. A "synthetic antibody" is considered herein to include fragments and 
hybrids of antibodies. 

A further aspect of the present invention contemplates a method of screening for mutations 
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or other abberations in the a-N-acetylglucosaminidase gene in a human or animal patient. 

Such a method may be accomplished in a number of ways including isolating a source of 
DNA-to be tested or mRNA-therefrom and-hybridising thereto a nucleic acid molecule as 

hereinbefore described. Generally, the nucleic acid is probe or primer size and polymerase 
5 chain reaction is a convenient means by which to analyse the RNA or DNA. Other suitable 

assays include the ligation chain reaction and the strand displacement amplification 

methods. The a-N-acetylglucosaminidase sequence can also be determined and compared 
to the naturally occurring sequence— Such methods may be useful in adults and children 

and may be adapted for a pre-natal test The DNA to be tested includes a genomic sample 
1 0 carrying the a-N-acetylglucosaminidase gene, a cDNA clone and/or amplification product. 

In accordance with this aspect of the present invention there is provided a method for 
screening for abberations in the a-N-acetylglucosaminidase gene including the absence of 
such a gene or a portion or a substantial portion thereof comprising isolating a sample of 

!5 DNA or mRNA_con;esponding_to_a-region-of-said_DNA-and-contactm with an 

oligonucleotide probe-capable of hybridising tooneor morecomplementary sequences 

within the a-N-acetylglucosaminidase gene and then detecting the hybridisation, the extent 
of hybridisation or the absence of hybridisation. 

20 Alternatively, the probe is a primer and capable of directing amplification of one or more 
regions of said a-N-acetylglucosaminidase gene and the amplification products and/or 
profile of amplification products is compared to an individual carrying the fiill gene or to 
a reference date base. 

25 Conveniently, the amplification products are sequenced to determine the presence or 
absence of the full gene. 

The present invention extends to the use of any and all DNA-based or nucleic acid-based 
hybridisation and/or polymerase chain reaction formats as described herein, for the 
30 diagnosis of a disorder involving the a-N-acetylglucosaminidase gene in a human or 
animal patient 
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The present invention further extends to a method of treating patients suffering from a-N- 
acetylglucosaminidase deficiency, such as in MPS-HIB, said method comprising 
administering to said patient an effective amount of o=N=acetylglucosaminidase or-active 
like form thereof. 

Preferably, the a-N-acetylglucosaminidase is in recombinant form. Such a method is 
referred to as "enzyme therapy". Alternatively, gene therapy can be employed including 
introducing an active gene (i.e. a nucleic acid molecule as hereinbefore described) or to 
parts of the gene or other sequences which facilitate expression of a naturally occurring o- 
N-acetylglucosaminidase gene. 

Administration of a-N-acetylglucosaminidase for enzyme therapy may be by oral, 
intravenous, suppository, intraperitoneal, intramuscular, intranasal, intradermal or 
subcutaneous administration or by infusion or implantation. The a-N- 
acetylglucosaminidase is preferably as hereinbefore described including active mutants or 
-derivatives th^wf Md^ 5 y way 

of gene therapy include of the gene in viral vectors 

which are introduced into the animal (e.g. human) host to be treated. Alternatively, the 
gene may be expressed in a bacterial host which is then introduced and becomes part of the 
bacterial flora in the animal to be tested. 

Still yet another aspect of the present invention is directed to a pharmaceutical composition 
comprising synthetic (e.g. recombinant) a-N-acetylglucosaminidase or like molecule, 
including active derivatives and fragments thereof, alone or in combination with other 
active molecules. Such other molecules may act synergistically with the enzyme or 
facilitates its entry to a target cell. The composition will also contain one or more 
pharmaceutically acceptable carriers and/or diluents. The composition may alternatively 
comprise a genetic component useful in gene therapy. 

The active ingredients of the pharmaceutical composition comprising the synthetic or 
recombinant a-N-acetylglucosaminidase or mutants or fragments or derivatives thereof are 
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contemplated to exhibit excellent activity in treating patients with a deficiency in the 
enzyme when administered in an amount which depends on the particular case. The 
— -variation-dependsrfor example,-on the patient-and-the a^N^acetylglucosaminidase used. 

For example, from about 0.5 ug to about 20 mg of enzyme per animal body or, depending 
5 on the animal and other factors, per kilogram of body weight may be administered. Dosage 
— regima may-be adjusted-to provide the optimumrtherapeutic response. For example, several 
divided doses may be administered daily, weekly, monthly or in other suitable time 
intervals or the dose may be proportionally reduced as indicated by the exigencies of the 
situation. Accordingly, alternative dosages in the order of 1.0 fj.g to 15 mg, 2.0 yug to 10 
10 mg or lO^zg to 5mg may be administered in a single or as part of multiple doses. The 
active compound may be administered in a convenient manner such as by the oral, 
intravenous (where water soluble), intramuscular, subcutaneous, intranasal, intradermal or 
suppository routes or implanting (eg using slow release molecules). Depending on the route 
of administration, the active ingredients which comprise a synthetic (e.g. recombinant) o- 
1 5 N-acetylglucosaminidase or^fragments^derivatives or-mutants thereofmay be required to 
—bexoated in a^aterial^toprm and other natural 

OTnditibm which may inactivate said ingredients. For example, the low lipophilicity of o- 
N-acetylglucosaminidase will allow it to be destroyed in the gastrointestinal tract by 
enzymes capable of cleaving peptide bonds and in the stomach by acid hydrolysis. In order 
20 to administer the vaccine by other than parenteral administration, the enzyme will be 
coated by, or administered with, a material to prevent its inactivation. For example, the 
enzyme may be administered in an adjuvant, co-administered with enzyme inhibitors or 
in liposomes. Adjuvant is used in its broadest sense and includes any immune stimulating 
compound such as interferon. Adjuvants contemplated herein include resorcinols, non- 
25 ionic surfactants such as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether. 
Conveniently, the adjuvant is Freund's Complete or Incomplete Adjuvant. Enzyme 
inhibitors include pancreatic trypsin inhibitor, diisopropylfluorophosphate (DEP) and 
trasylol. Liposomes include water-in-oil-in-water emulsions as well as conventional 
liposomes. 

30 

The active compound may also be administered in dispersions prepared in glycerol, liquid 
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polyethylene glycols, and/or mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 
m icroorganisms- . 

5 The pharmaceutical forms suitable for injectable use include sterile aqueous solutions 
(where water soluble)., or_djsp_ersions and steriIe_powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. In all cases the form must be sterile 
and must be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the contaminating 
10 action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the required particle size in the case of 
1 5 dispersion and by the use of superfactants. The preyentionof Jhe.action of microorganisms 
_i-=can-be-brought about-hy-various antibacterial and antifungal agentspfor example, parabens, 
chiorobutanol, phenol, sorbic acid, thirmerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged 
absorption of the injectable compositions can be brought about by the use in the 
20 compositions of agents delaying absorption, for example, aluminum monostearate and 
gelatin. 

Sterile injectable solutions are prepared by incorporating the active compound in the 
required amount in the appropriate solvent with various of the other ingredients 

25 enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredient(s) into a sterile 
vehicle which contains the basic dispersion medium and the required other ingredients 
from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and the 

30 freeze-drying technique which yield a powder of the active ingredient plus any additional 
desired ingredient from previously sterile-filtered solution thereof. 
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When the a-N-acetylglucosaminidase of the present invention is suitably protected as 
described above, the composition may be orally administered, for example, with an inert 
^ i dilu«t_orj8ath an assimilable_edible.carrier,.or it may_be-enclosed-in hard or soft shell 
gelatin capsule, or it may be compressed into tablets, or it may be incorporated directly 
5 with the food of the diet For oral therapeutic administration, the active compound may be 
. ... incor P° m ed with excipients and used wjhe form of ingestible tablets, buccal tablets, 
troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and 
preparations should contain at least 1% by weight of active compound. The percentage of 
the compositions and preparations may, of course, be varied and may conveniently be 
1 0 between about 5 to about 80% of the weight of the unit. The amount of active compound 
in the vaccine compositions is such that a suitable dosage will be obtained. Preferred 
compositions or preparations according to the present invention are prepared, so that an 
oral dosage unit form contains between about 0.5 ug and 20 mg of active compound. 

15 The tablets, troches , pills, capsules and the like ma y a lso contain the following- a binder 
such- as gum gragacanth, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as corn starch, potato starch, alginic acid and the 
like; a lubricant such as magnesium stearate; and a sweetening agent such a sucrose, lactose 
or saccharin may be added or a flavoring agent such as peppermint, oil of wintergreen, or 

20 cherry flavouring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, 
pills, or capsules may be coated with shellac, sugar or both. A syrup or elixir may contain 
the active compound, sucrose as a sweetening agent, methyl and propylparabens as 

25 preservatives, a dye and flavoring such as cherry or orange flavor. Of course, any material 
used in preparing any dosage unit form should be pharmaceutically pure and substantially 
non-to xic in the amounts employed. In addition, the active compound may be incorporated 
into sustained-release reparations and formulations. 
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As used herein "pharmaceutically acceptable carriers and/or diluents" include any and all 
solvents, dispersion media, aqueous solutions, coatings, antibacterial and antifungal agents, 
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isotonic and absorption delaying agents, and the like. The use of such media and agents for 
pharmaceutical active substances is well known in the art. Except insofar as any 
conventional media or agent is incompatible with the active ingredient, use thereof in the 
pharaiaceutical compositions is contemplated. Supplementary active ingredients can also 
be incorporated into the compositions. 

The present invention further relates to the use of a-N-acetyiglucosaminidase or active 
fragment, mutant or derivative thereof in the manufacture of a medicament for the 
treatment of patients suffering from a deficiency in the naturally occurring enzyme (e.g. 
MPS-mB). 

The present invention is further described with reference to the following non-limiting 
Examples. 

^ EXAMPLE 1 — — 

Purificationof a-N-A^ 

a-N-acetylglucosaminidase was purified according to the method described in Weber et 
al. (1996). Enzyme was purified to homogeneity from human placenta. Evidence of 
purity is shown following SDS/PAGE which is represented in Figure 1. All samples were 
reduced with dithiothreitol prior to electrophoresis. 

EXAMPLE 2 
Characterisation of a-N-Acetylglucosaminidase 

Results presented in Figure 1 show two polypeptides of about 82kDa and 77 kDa 
molecular weight, which correspond to a-N-acetylglucosaminidase polypeptides purified 
from human placenta according to Example 1. 

EXAMPLE 3 
Amino Acid Sequence Determination 
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The N-terminal amino acid sequences the 77 kDA and 82 kDa a-N-acetylglucosaminidase 
polypeptides^in addition to'the amino ^ cleavage product 

of these peptides, were determined using the methods of Weber et aL ( 1 996). 



The amino acid sequences are shown in~Table 4t 
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TABLE 4 

N-Terminal amino acid sequences (SEQ ID NO:4 and SEQ ID No:5) and 
— -GNBr peptide sequence (SEQ n> No:6) determined from purified human 

a-N-Acetylglucosaminidase 

5 



polypeptide 82 kDa 
polypeptide 77 kDa 

10 

CNBr peptide 



X 1 no residue could be identified for_this position, indicating that this residue could be 
15 phosphorylated or-glycosylated. — - 



DEAREAAAVRALVARLLGPG (^^J^N^T) 
KPGLDTYSLGGGGAAX 1 VR (_f&&, \ D MO I ^) 
WRLLLTSAPSLX 1 TX I P (j&^T>^0^ 
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EXAMPLE 4 
Cloning of a-N-Acetylglucosaminidase cDNA 



Oligonucleotide probes were prepared based on the partial amino acid sequences obtained 
for the purified a-N-acetylglucosaminidase polypeptides (Example 3). The probes were 
subsequently used to screen a human peripheral blood leukocyte cDNA library An 
approximately 2.6 kbp cDNA clone was isolated encoding most of the sequence of human 
25 a-N-acetylglucosaminidase (SEQ ID NO: 1). 
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The remaining a-N-acetylglucosaminidase coding sequence was obtained from the 
nucleotide sequence of the corresponding genomic gene (SEQ ID NO:3), isolated by 
hybridisation to a human chromosome 17 library (Weber etal. 1996). 
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The complete open reading frame is 2232 nucleotides long and encodes a 743 (plus stop 
codon) amino acid protein. The predicted molecular mass of the longest mature protein 
jminus the 23 amino acid N-termin al signd P ,eptide),is.about-7.9 > 622daltons. 



5 The amino acid sequence of o-N-acetylglucosaminidase is shown in SEQ ID NO:2. The 
deduced molecular weight _ of the desired amino acid sequence of a-N- 
acetylglucosaminidase is approximately 70kDa. The probable site of signal peptide 
peptidase cleavage is between amino acids 23 and 24. There are seven potential N- 
glycosylation sites in the sequence. 
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The nucleotide sequence of the corresponding a-N-acetylglucosaminidase genomic gene 
(SEQ ID No:3) comprises 10380 bp including 889 bp of 5' upstream sequence 
corresponding to at least at part of the a-N-acetylglucosaminidase promoter sequence, in 
addition to the nucleotide sequences of introns I, H, IL IV, V, in addition to 1326 bp of 3'- 
15 untranslated sequence. 

EXAMPLE 5 
Construction of an expression vector comprising the 
a-N-Acetylglucosaminidase cDNA sequence 

20 

The cDNA insert of * clone pbl 33, containing bases 107 to 2575 of the o-N- 
acetylglucosaminidase cDNA was excised with EcoXI and subcloned into pBluescript E 
SK-(Stratagene). A 178 bp Xmal fragment (bases 1 to 178 of the a^N- 
acetylghacosaminidase cDNA) from cosmid sub-clone 6.3, containing the start codon, was 
25 cloned into the pBluescript subclone to produce a full-length cDNA sequence in addition 
to 101 bp of 5' non-translated sequence as well as 245 bp of 3' non-translated region 
including the polyadenylation-site, the polyA-tail and linkerDNA The full length cDNA 
was directionally cloned into the pCDNA3 expressionvector (Invitrogen) via the EcoRI and 
BamHI sites. 

30 



EXAMPLE 6 
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Expression of recombinant a-N-acetyiglucosaminidase 



-Chmese.Hamster_Ovary-(C^^ using the 

DOTAP transfection reagent (Boehringer Mannheim) according to the manufacturers 
5 instructions. Cells were grown in Ham's F12 medium, 10% (v/v) fetal calf serum, 
penicillin and streptomycin sulfate at 100 /zg/ml each. Cells were grown in nonselective 
medium for 48 h and then incubated in medium containing 750 /zg/ml G418 sulfate 
(Geniticin) until resistant colonies emerged. 

10 Single cell clones were grown up and 26 of them were tested for expression of recombinant 
o-N-acetylglucosaminidasewitha fluorogenic o-N-acetylglucosaminidase substrate, (i.e. 
N-acetylglucosamine o-linked to 4-methylumbelliferone) 

EXAMPLE 7 

15 Large scale o-N-acetylglucosaminidase production 

2 g of Cytodex 2 microcarrier beads were swollen in 250 ml of PBS for 3 h at 37*C with 
three changes of PBS and then autoclaved for 15 min at 120 "C (wet cycle). The beads 
were then rinsed with sterile growth medium (Coons/DMEM, 10% v/v fetal calf serum, 
20 penicillin and streptomycin sulfate at 100 /zg/ml each and 0.1% w/v Pluronic F68) and 
transferred into a Techne stirrer culture flask. The microcarrier beads were inoculated with 
seven confluent 175 flasks of the cell clone showing the highest expression of recombinant 
a-N-acetylglucosaminidase. Growth medium was added up to 200 ml and the culture 
incubated with a stirrer speed of 20 rpm to achieve an even distribution of cells on the 
25 beads. The cells were allowed to attach to the beads for 16 h at low speed then medium 
was added up to 500 ml and the stirrer speed increased to 30 rpm. After a growth phase 
of 48 to 72 h with daily aerating to allow gas exchange the beads were completely covered 
with cells and the medium was exchanged for production medium (Coons/DMEM, no fetal 
calf serum, penicillin and streptomycin sulfate at 100 /zg/ml each, 0.1% w/v Pluronic F68 
30 and 5 mM NH 4 C1). The glucose concentration was monitored daily and the medium 
replaced, when glucose fell below 5 mM every 203 days. The harvested medium contained 
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approximately 2 mg a-N-acetylglucosaminidase protein per dm 3 of production medium. 



EXAMPLE 8 

Purification of recombinant a-N-acetylglucosaminidase 

5 

Production medium was dialysed against 50 mM NaAc pH 5.5 and loaded onto a heparin- 
agarose column equilibrated in the same buffer. After washing with NaAc buffer and 
NaAc/50 mM NaCl the column was eluted with 75 mM NaCl in NaAc buffer. The eluate 
was dialysed against 20 mM Tris/HCl pH 7.5, loaded onto a DEAE Scphacel column, 
10 washed with 25 mM NaCl in 20 mM Tris/HCl and then eluted with 50 and 75 mM NaCl 
ig in 20 mM Tris/HCl respectively. 

5.2 

SAf - 

]fj SDS-PAGE of the two eluates showed two polypeptide bands associated with enzyme 

1^ activity with apparent molecular weights of 79 and 89 kDa. The smaller a-N- 

fil 

, " 15 acetylglucosaminidase was eluted predominantly in the 50 mM NaCl fraction whereas the 

*g 89 kDa a-N-acetylglucosaminidase polypeptide was enriched in the 75 mM NaCl fraction 

M (Fig. 2). — 

O 

The difference in apparent molecular weight of the recombinant a-N- 
20 acetylglucosaminidase polypeptides is due to the presence of additional carbohydrate side 
chains, since a digest with PNGase F, which cleaves off N-glycosylation groups, reduced 
both the 79 kDa and 89 kDa polypeptides to the polypeptide band having an apparent 
molecular weight of about 70 kDa (Fig. 2), which corresponds to the approximate 
molecular weight deduced from primary amino acid sequence data (SEQ ID No:2). 

"25 

EXAMPLE 9 

Characteristics of recombinant a-N-acetylglucosaminidase 

No differences were observed between the enzyme activities of the 79 and 89 kDa 
30 recombinant a-N-acetylglucosaminidase polypeptides produced in CHO cells according 
to Example 7 and 8. With the fluorogenic N-acetylglucosamine a-linked to 4- 
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methylumbelliferone (4-MU) substrate, the enzyme has a pH-optimum of 4.6 with a k M of 
5.34 mM and a of 3.97 x 10 6 pmol/min/mg. Towards a 3 H-labelled disaccharide 
substrate it should a.pHK>ptimumjof^ x 10 4 

pmol/min/mg. 

EXAMPLE 10 
Mutational analysis of Sanfilippo B patients 

Genomic DNA is isolated from cultivated skin fibroblasts of patients by extraction with 
Phenol/Chloroform and used to amplify the eight exons and adjacent intronic sequences 
individually by PCR 

Primer sequences used in the amplification reaction are readily determined from the 
nucleotide sequences of the a-N-acetylglucosaminidase cDNA and genomic clones, set 
forth in SEQj D_N o: 1 or SEQ ID No:3 . Am plification conditions are also readily 
determined without undue experimentation. Procedures for the design of PCR primers and 
amplification conditions are described in detail, for example, by McPherson et al (1991). 
Differences in the primary sequence can be identified by separating the PCR products on 
a polyacrylamide gel under non-denaturing conditions (SSCP gels). Base changes, 
insertions and deletions will lead to a different band pattern compared with the wildtype 
in most of the cases, which can be visualised either by autoradiography of the gel after 
labelling the DNA during the PCR or by staining unlabelled DNA in the gel with silver. 
PCR products which show a different band pattern are sequences to identify the change. 
Other patient samples can be tested for mutations and polymorphism that were found by 
-hybridisation with wildtype- and mutation-specific oligonucleotides (ASO). 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
includes all of the steps, features, compositions and compounds referred to or indicated in 
this specification, individually or collectively, and any and all combinations of any two or 
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more of said steps or features. 



Sis 
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